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' In astronomy, interferometry of light collected by separate telescopes is often per- 



formed by physically bringing the optical paths together in the form of Young's 
double-slit experiment. Optical loss severely limits the efficiency of this so-called 
direct detection method, motivating the fundamental question of whether one can 



CN ■ achieve a comparable performance using separate optical measurements at the two 

> _ 

telescopes before combining the measurement results. Using quantum mechanics and 



estimation theory, here I show that any such spatially local measurement scheme, 



such as heterodyne detection, is fundamentally inferior to coherently nonlocal mea- 

o ; 

surements, such as direct detection, for estimating the mutual coherence of bipartite 
thermal light when the average photon flux is low. This surprising result reveals an 
overlooked signature of quantum nonlocality in a classic optics experiment. 

PACS numbers: 03.65.Ud, 42.50.Ar, 95.55.Br 



The basic goal of stellar interferometry is to retrieve astronomical information from the 



Rial- 



The imaging reso- 



mutual coherence between optical modes collected by telescopes 
lution increases with the distance between the collected optical modes called the baseline, 
motivating the development of long-baseline stellar interferometry using light collected from 
a telescope array 0, [J. The standard method of stellar interferometry in the optical regime 
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is called direct detection, which coherently combines the optical paths in the form of the 
classic Young's double-slit experiment, but its efficiency suffers from decoherence in the 
form of accumulating optical loss along the paths as the baseline is increased. To avoid 
optical loss, an alternative method is to perform separate heterodyne detection at the two 
telescopes, before combining the measurement results via classical communication and data 
processing In quantum information theory, direct detection can be classified as a 

nonlocal measurement scheme, which requires joint quantum operations on the two optical 
modes, while heterodyne detection is a local measurement scheme, which does not require 
quantum coherence between the separate detectors {4], 5]. Townes has previously analyzed 
the quantum noises in direct and heterodyne detection and concluded that direct detection 
is superior at high optical frequencies and heterodyne detection is superior at low frequencies 
(3), Q]. Heterodyne detection is, however, only one example of local measurements, and it 
remains a fundamental and important question whether any other local measurement can 
perform as well as nonlocal measurements while not suffering from decoherence. 

The main purpose of this Letter is to prove that, in the case of weak thermal light, any 
local measurement scheme must be significantly inferior to a nonlocal one for the estimation 
of the mutual coherence according to quantum mechanics. This is a surprising result in 
quantum metrology, since the disadvantage of local measurements does not otherwise oc- 
cur for coherent states at any strength, a well-studied case in quantum metrology [7] , strong 
thermal light, in which case there is little difference between direct and heterodyne detection 
8], or even the single-photon state assumed by Gottesman, Jennewein, and Croke in their 
proposal of shared-entanglement stellar interferometry [9] . This quantum measurement non- 
locality can be regarded as a dual of Einstein- Podolsky- Rosen entanglement (4, 10]: Despite 
the fact that bipartite thermal light has a well-defined classical description and possesses no 
quantum entanglement, nonlocal quantum measurements are necessary to extract the most 
information from the light. For optical interferometry and imaging applications in general, 
the result demonstrates the fundamental advantage of nonlocal measurements for weak ther- 
mal light and motivates the development of coherent optical measurement techniques, such 
as integrated optical information processing I12I ] and entanglement sharing . 

Consider the estimation of first-order spatial coherence (g^) between two distant optical 
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FIG. 1: (Color online). Schematics of (a) the direct detection scheme, an example of nonlocal 
quantum measurement, and (b) a local measurement scheme, which performs spatially separate 
measurements and permits only classical communication and control between the two sites. Ex- 
amples of the latter include heterodyne and homodyne detection. 

modes. In quantum optics, bipartite thermal light is described by the density operator 

p= [ d 2 ad 2 (3^(a,(3)\a,(3)(a,(3\, (1) 



where \a,(3) is a coherent state with amplitudes a and f3 in the two modes and is 



the Sudarshan-Glauber representation 



lfl, given by 



d>(a,/3) 



1 



7T 2 det T 

T is the mutual coherence matrix: 



cxp 



- (a* /?*) 




(2) 



r = 



F F 

L aa 1 ab 



(3) 



[a'a) (tfa) 

r,„, r,,,, / \ (rib) (tfb) 

and a and b are annihilation operators of the optical modes. The zero-mean Gaussian 
statistics are a standard assumption for astronomical sources in theoretical optics 
The positive $ function indicates that the two modes are classically correlated only and 
possess no quantum entanglement [131 ] . 

Let (a) a) = (b^b) = e/2 for simplicity. For an incoming light with photon- flux spectral 
density S(v) and a relatively narrow detector bandwidth Au around a center frequency 
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Uq, the filtered photon flux is S(u )Au. Over the duration of the effective temporal mode 
At ~ e = 5*(z/o)Az/At ~ 5 , (z/ ) turns out to be independent of the detector bandwidth 

and a function of the source and the telescope efficiency only. Considering the case e -C 1, as 
is common for interferometry with high optical z/o, the density operator can be approximated 
in the photon-number basis as 

p = (1 - e)|0,0)(0,0| + |[|0, 1)(0, 1| + |1,0)(1,0| 

+ 2*|0,i>(l,0|+2|l,0>(0,i|]+O(e 2 ), (4) 

where I have defined eg/ 2 = T ab = Tl a and g = g\ + ig 2 as the complex degree of coherence 
with \g\ < 1 In the following, I neglect the small 0(e 2 ) terms, assume that e is known, and 
gi and g 2 are the unknown parameters to be estimated. The assumption of a known e should 
be reasonable, as other non-interferometric imaging methods can be used to estimate the 
average photon flux and are usually much less sensitive to noise • Otherwise e should also 
be regarded as an unknown parameter to be estimated by the interferometer, a complication 
outside the scope of this Letter. 

Any measurement in quantum mechanics can be modeled by a positive operator-valued 
measure (POVM) E(y) 0, [jjj], which determines the probability of the observation y: 

P(y\g) = tr [E(y)p\. (5) 

For example, in the direct detection scheme (Fig. [11(a)), the two optical modes are brought 
to interfere at a 50-50 beam splitter and the photons at the two output ports are counted. 
It can be shown by standard quantum optics calculations |S|] that the POVM E(n, m) for 



photon counts n and m are 

£(0,0) = |0,0>(0,0|, (6) 

E(l, 0) = \ (|1, 0} + e" i5 |0, 1)) ((1, 0| + e t5 (0, 1|) , (7) 

E(0, 1) = \ (|1, 0) - e-* 5 |0, 1)) ((1, 0| - e i5 (0, 1|) , (8) 
where 5 is an adjustable phase shift on the b mode. The observation probabilities become 

P(0,0|s) = l-e, (9) 

P(l,0|^) = ^[l + Re(^- i5 )], (10) 

P(0,l\g)= 6 -[l-Re(ge- iS )]. (11) 
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To evaluate the parameter-estimation capability of a measurement scheme, consider the 



Fisher information matrix, defined as 



15| 



P(y\g) 



D(y\g), 



\dP{y\g) 



D(y\g) 



dP(y\g) dP(y\g) 



[ 991 
dP(y\g) dP(y\g) 
9gi dgi 



dgi 



9g2 



9P(y\g) 
9g2 



(12) 



(13) 



The inverse of the Fisher information matrix provides a lower Cramer-Rao bound to the 
mean-square estimation error covariance matrix E for any unbiased estimate in the form of 
S > F . The eigenvalues of F, which must be nonnegative as F > 0, hence quantify the 
amounts of independent information obtainable from the measurement. In a total observa- 
tion time interval T over which the model parameters can be approximated as time-constant, 
M ~ T/At ~ T Av measurements can be performed, and the total Fisher information is 
_p( M ) = MF ~ TAvF. In the limit of large M, the Cramer-Rao bound is asymptotically 
achievable by maximum-likelihood estimation. This makes the Fisher information a rig- 
orous metric for comparing the inherent capabilities of different measurement schemes for 
parameter estimation. 

The Fisher information for direct detection is 



1 — Re(ge~ 



-iS\2 



and the eigenvalues of F are 
Ai = 0, 



cos 2 S sin 5 cos 5 
sin 5 cos 5 sin 2 5 



A, 



(14) 



(15) 



1 - Re(ge~ i5 ) 2 ' 

The zero eigenvalue corresponds to the absence of information about the unobservable 
quadrature Im(ge~ 4<5 ) . In practice, 5 is varied over measurements to retrieve information 
about both quadratures of g. The important point to note here is that = Ai + A 2 > e 
if we take the trace norm. The norm of the total Fisher information for M measurements 
becomes ||F( M )|| = M||F|| > Me, which scales linearly with the average photon num- 
ber Me, thereby achieving the optimal "shot-noise" scaling for parameter estimation using 
classical states 16|. Similarly, it is shown in js] that the Fisher information for the shared- 
entanglement scheme proposed by Gottesman, Jennewein, and Croke has in theory the 
same expression but reduced by a factor of 2. 
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Both of the aforementioned schemes can be considered as nonlocal quantum measure- 
ments, which require bringing the two modes together physically or sharing entanglement 
between the two sites. The physical nonlocality makes such schemes increasingly challenging 
to implement technically as the distance between the two modes increases, primarily due 
to accumulating decoherence in the form of optical loss along the paths [2[. Local mea- 
surement schemes, on the other hand, measure the two modes separately before combining 
the results via classical communication (Fig. QJb)), and can therefore be implemented over 
a much greater distance in principle. To investigate the general performance of any local 
measurement, let us write the observation probability distribution for POVM E(y) explicitly 
as 

P{y\g) = (1 - e)E 00t00 (y) + | [E 01;01 (y) 

+ £io,io(2/) + 2|£i ,oiQ/)| Refce-*)], (16) 

where 

E nrn ,n'm'(y) = (n,m\E(y)\n',m') (17) 

and 5 is the phase of £10,01 • To put a bound on the Fisher information given by Eq. ffl2|) . 
note that 

P(y\g)>(l-e)E 00 , 00 (y), (18) 
and the positive-semidefinite matrix 

„, . . l9 / cos 2 5 sin 5 cos 5 \ / 
D = e 2 \E Wfil (y)\ 2 I (19) 

\ sin 5 cos 5 sin 5 I 

defined in Eq. ffl3|) has a trace norm given by e 2 \E W01 (y)\ 2 . Applying the subadditivity 
property of matrix norms to Eq. (fl2l) results in an upper bound on 

\E Wfi i(y)\ 2 

y 

For generality, I define local measurements as the ones performed using local opera- 
tions with classical communication (LOCC), which permits the measurement at one site 
to be conditioned upon the observation at the other site. A necessary condition for a 
spatial-LOCC POVM is the positive-partial-transpose condition E Ta (y) > [l^]. By the 



|F[|<— y 1 ' 0,01 r ■ (20) 
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Cauchy-Schwarz inequality, | (1, 0\E\0, 1) | 2 = |(0, 0\E Ta \l, 1)| 2 = | (0, 0| y/W^y/E^\l, 1) | 2 < 
(0,0|£ r «|0,0)(l,l|£ r «|l,l) = (0,0|£|0,0)(1,1|£|1,1), or 

|^io,oi(l/)r<^oo I oo(l/)^ii I u(l/)- (21) 
Combining Eqs. ([20]) and fl2T|) . I obtain an 0(e 2 ) upper bound on ||-F||: 

IIFI 



e 2 „ , , e 2 



< T ^E^^) = T^' ( 22 ) 

y 

where £11,11 (y) = 1 comes from the completeness property of a POVM. The neglected 
0(e 2 ) term in the density operator in Eq. (j3J) contributes an additional 0(e 2 ) term to P and 
an 0(e 3 ) term to D, so the Fisher information would be modified by an 0(e 3 ) term and the 
upper bound in Eq. ( 122]) should be rewritten as 

\\F\\ < e 2 + 0(e 3 ). (23) 

For M measurements, the bound can be generalized to allow for adaptive measurements 
conditioned upon previous observations, as shown in Ref. 8|: 

||F( M )||<M[e 2 + 0(e 3 )]. (24) 

This upper bound shows that the best Fisher information any spatiotemporal-LOCC mea- 
surement can achieve is still substantially worse than that of the spatially nonlocal methods 
(||_p( M )|| rsj Me) when e « 1. In other words, spatially local measurements are funda- 
mentally much less efficient than nonlocal methods in extracting coherence information 
from weak-thermal-light interferometry. This general proof is supported by explicit Fisher- 
information calculations for heterodyne and homodyne detection Q], signal-to-noise-ratio 
calculations for direct and heterodyne detection of the full thermal state given by Eq. (pQ) 
and the known fact in astronomy that direct detection performs better than heterodyne 



detection for high optical u |3j, |6j. Ref. js] also includes a discussion of the quantum origin 
of the nonlocality in terms of the semiclassical photodetection picture. 

Note that the advantage of nonlocal measurements is lost for coherent states, strong 
thermal light with e> 1 or even the nonclassical single-photon state studied in Ref. {9]. 
For coherent states, \g\ — 1 and the unknown parameters are the phases of the two optical 
modes in a product of coherent states, in which case it can easily be shown that nonlocal 
measurements are not necessary, analogous to the case of single-parameter phase estimation 
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with a product state [18j. For strong thermal light with e > 1, calculations in Ref. |8j show 



is 



that the performances of direct detection and heterodyne detection converge and suggest 



that the noise in this regime is dominated by the thermal statistics o 



the source rather than 



the detection statistics. The single-photon state studied in Ref. {9] can also be analyzed 
using the formalism here by omitting 0(e 2 ) terms and then putting e = 1, resulting in 
comparable performances for local and nonlocal measurements. 

The peculiar existence of quantum nonlocality for weak thermal light, as a property of 
bipartite measurements applied to certain separable states, can be regarded as a dual of 



Einstein-Podolsky- Rosen entanglement [4|, |10], a property of bipartite states that can pro- 



duce higher correlations in certain separable measurements. In the context of quantum 
communication theory, it is well known that nonlocal measurements can extract more in- 



formation from states with no entanglement {4, 5, the result here provides a striking 
example in which the same type of quantum nonlocality readily exists for observers extract- 
ing information from nature. 

For practical applications, the result here demonstrates the fundamental advantage of 
nonlocal quantum measurements for weak-thermal-light interferometry and may have further 
implications for optical imaging systems, such as compound-eye imaging and fluorescence 
microscopy The shared-entanglement proposal in Ref. |9| requires a path-entangled 

single-photon source and quantum repeaters, both of which are unlikely to become feasible 
in the near future, but standard linear optics can also perform nonlocal measurements by 
coherently processing multiple optical modes before detection, provided that optical loss 
can be minimized. In the short term, the result here thus motivates the development of 
low-loss coherent optical information devices, such as photonic cry stal fibers and integrated 
photonics, for thermal-light interferometry and imaging 

Accurate coherence information can be obtained only in the limit of many collected pho- 
tons. This corresponds to measurements of many copies of the quantum state. A more 
general quantum measurement strategy than the ones considered here involves joint quan- 
tum operations on the multiple copies before measurements. This kind of temporal non- 
locality is not needed for parameter estimation when spatially nonlocal measurements can 



be performed [16j. It remains an interesting open question whether coherent temporally 
nonlocal strategies can offer any significant advantage when one is restricted to spatially 
local measurements. Other potential generalizations include time-varying parameters and 
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the estimation of temporal coherence for spectroscopy in addition to spatial coherence. One 
must then take into account the dynamics of the source and colored noise, which can be 
analyzed using the quantum waveform estimation framework developed in Refs. {20 1. 
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man, Carlton Caves, Howard Wiseman, Alexander Lvovsky, Christoph Simon, Mohan 
Sarovar, Alexander Tacla, Jiang Zhang, Matthias Lang, Shashank Pandey, and Stefano 
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Supplementary Material 

This Supplementary Material contains supportive calculations and discussions that com- 
plement the main text. Section |X] derives the positive operator-valued measure (POVM) 
for direct detection, Sec. [B] derives the POVM for shared-entanglement interferometry and 
the resulting Fisher information, Sec. O calculates a bound on the total Fisher information 
for multiple adaptive measurements, Sees. [D] and [E] calculate the Fisher information for 
heterodyne and homodyne detection, Sec. IF1 investigates the performances of direct and het- 
erodyne detection for strong thermal light, and Sec. [G] discusses the origin of the quantum 
nonlocality in terms of the semiclassical photodetection picture. The reference list at the 
end is identical to the one in the main text for easier cross-referencing. 

Appendix A: POVM for direct detection 

In direct detection, the optical modes a and b are combined by a beam splitter and 
photon-counting is performed at the two output ports. Let U be the unitary operator that 
corresponds to the operation of the beam splitter on the bipartite quantum state p. The 
observation probability distribution is 

P(n,m\g) = (n,m\UpU'\n,m), (Al) 
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where \n,m) is a Fock state. We can then write the POVM as 

E(n,m) = U'\n, m)(n, m\U, (A2) 

which propagates the Fock-state projection back to the time when the state of light is p. 

With at most one photon in the quantum state, we are interested in (n, m) = 
(0, 0), (1, 0), (0, 1) only. Applying the unitary to the Fock states, 

^|0,0> = |0,0>, (A3) 

C/ f |l,0) = W<JUW\0,0) = WdU\0,0) (A4) 

= _L( a t + e -^t)| 0;0) (A5) 

= 4(|l,0> + e-* 5 |0,l», (A6) 



V2 

f/t|o,l) = i=(|l,0)-e-^|0,l)). (A7) 



The POVM is hence 



£(0,0) = |0,0)(0,0|, (A8) 
E(l, 0) = l - (|1, 0) + e- 5 |0, 1)) ((1, 0| + e i5 (0, 1|) , (A9) 

E(0, 1) = \ (|1, 0) - e-* 5 |0, 1)) ((1, 0| - e l5 (0, 1|) . (A10) 



Appendix B: Shared-entanglement interferometry 

Assuming an entangled ancilla in two modes c and d given by 

|5) = -L(|0,l) Cid + e i5 |l,0) Cjd ), (Bl) 

with each mode sent to the sites of a and b modes for separate interference measurements 
9), the POVM for photon counts (n,m,n',m') is 

E(n,m,n',rn') = (6\Ul c ®Uljn,m,n\m'}(n,m,n',m'\U ac ®U bd \5), (B2) 

where U ac denotes the beam-splitting unitary on modes a and c and Um denotes the same 
unitary on modes b and d. The calculation is more involved but similar to the one for direct 
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detection. The final result is 



£M = |0,0>(0,0| 

1, 



E(yi 



|0,1)(0,1|, 



E(y 2 ) = -\1,0)(1,0\, 

E(y 3 ) = l(|l,0) + e- i5 |0,l)) ((l,0| + e"(0,l|) 



1 



E( 2/4 ) = i (|i,o)- e - l5 |o,i))((i,o|- e j5 (o,i|) 



(B3) 
(B4) 

(B5) 

(B6) 

(B7) 



where each yj corresponds to a set of (n,m,n' ,m') that produce the same POVM. When 
applied to the quantum state p, only observations y^ and y± contribute to the Fisher infor- 
mation about g. Since E(y$) = E(l, 0)/2 and E(y±) = E(0, l)/2, the Fisher information for 
shared-entanglement interferometry is simply that for direct detection reduced by a factor 
of 2: 

cos 2 5 sin 5 cos 5 , 

(B8) 



2[1-Be{ge iS ) 2 } \ s i n( 5cos5 sin 2 S 



Appendix C: Adaptive measurements 



For M measurements, the joint observation probability distribution can be written as 

P(VM,---,Vi\g) = P(vm\9,Vm-i,---,Vi)P(Vm-i,---,Vi\9)- (CI) 

Each element of the total Fisher information matrix for M measurements, using an alternate 
form of the Fisher matrix 15| . becomes 



F (M) = 



(J 2 



\nP(y M ,...,yi\g) 



V P(y M ,---,yi\g)-r, 
„ d 9jdg k 

1,--;VM 

^2 p (vM\g,yM~h ■ ■ -,yi)P{yM-i, ■ ■ -,yi\g) 



x 



Vi,—,Vm 

d 2 

dgjdg k 



lnP(y u \g,y M -i, ...,yi) + 



2 



dgjdg k 



In P(y 



M-l, 



yi\g) 



P(vm-u ■ ■ ■,Vi\g)F M 3k(VM-x, ■■■,Vi)+F i 



(M-l) 
jk ' 



Vl,—,VM-1 



where Fm denotes the conditional Fisher information of the Mth measurement: 



F M jk(yM-i, ...,yi) = -^T / P{y 



m\9, yM-i, 



-2/1, 



d 2 



UM 



dgjdg k 



(C2) 



(C3) 
(C4) 



\nP(y M \g,y M _ 1 ,...,y 1 ). (C5) 
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Applying the subadditivity property of matrix norms, 

|| F (M)||< J- P(yM-i,...,vi\g)\\F M \\ + \\F&- i >\\ (C6) 



yi,— ,vm-i 



< max \\F M \\ + \\F( M -V\\, (C7) 

2/1,— ,Vm-\ 



and by induction, 

M 

||F (M) ||<V max ||F m ||. (C8) 

m=l 

This proves that the norm of the total Fisher information cannot exceed the sum of the 
maximized single-measurement values. 

For the mth quantum measurement with outcome y m conditioned upon previous obser- 
vations, we can write 

P(y m \g, Vm-i, ■ ■ ■ , Vi) = tr [E(y m \y m _ h . . . , yx)p] . (C9) 

This means that the bound given by Eq. (23) in the main text for spatial-LOCC measure- 
ments in the case of e 1 is also applicable to ||-F m ||: 

||i^|| <e 2 + 0(e 3 ). (CIO) 

The total Fisher information is hence bounded by 

\\F^\\ < M [e 2 + 0(e 3 )] , (Cll) 

which generalizes the bound to the case of multiple spatiotemporal-LOCC measurements 
and proves that no adaptive strategy can improve the scaling ||F( M )|| ~ Me 2 . 

Appendix D: Heterodyne detection 



The POVM for heterodyne detection is 



H 



E ip^) = —r\n,v){fj,,v\, (Dl) 



7T 2 
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where v) is a coherent state and the normalization is f d 2 fid 2 uE(fi, v) = I, the identity 
operator. The relevant POVM matrix elements are 



1 



£ 00i00 (/i,z/) = -|(0,0|/i,z/>r (D2) 

1 



7T 



exp (-|/i| 2 - |z/| 2 ) , (D3) 



Soi,oi(//^) = ^|(0,1|//^)| 2 (D4) 



_L e xp(-|/i| 2 -|H 2 )|H 2 , (D5) 
1 



^io,io(^^) = ^|(l,0|/i,z/)| 2 (D6) 



7T 



1 exp(-\^\ 2 -\u\ 2 )\fi\ 2 , (D7) 



7T 2 



^exp^l^l 2 - |z/| 2 )/iV. (D9) 



The Fisher information is hence 



e 2 /10, ; 



F=^\ \+0(e i ), (D10) 



2 



1 



= e 2 + 0(e 3 ). (Dll) 

This shows that the performance of heterodyne detection is already the optimum allowed 
by quantum mechanics for any local measurement according to the bound given by Eq. (23) 
in the main text. 

Appendix E: Homodyne detection 

For homodyne detection, 

E(x,y) = \x,y){x,y\, (El) 

where \x,y) is a quadrature eigenstate: 

-^-= (ae~ i5a + a)e i&a ) \x, y) = x\x, y), (E2) 
V2 

_L (be-ih + 6 t e «6) |^ y) = y\ Xi y)) ( E3 ) 
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5 a and 5b are local-oscillator phases, and the normalization is f dxdyE(x,y) = I. The 
relevant POVM elements are 

£00,00(2:, y) = i exp (-x 2 - y 2 ) , (E4) 
2 

£01,01 (x, y) = - exp (-x 2 - y 2 ) y 2 , (E5) 
2 

£10,10(2;, y) = - exp (-x 2 - y 2 ) x 2 , (E6) 

7T 

£10,01 (x, y) = -e iS exp (-x 2 - y 2 ) xy, (E7) 
where 5 = 5 a — Sb- The Fisher information becomes 

„ / cos 2 5 sin 5 cos 5 \ / Qs 
F = e 2 +0(e 3 ), (E8) 

\ sin 5 cos 5 sin / 

||F|| = e 2 + 0(e 3 ). (E9) 

Homodyne detection is also able to saturate the bound given by Eq. (23) in the main text, 
but each measurement gives information about only one quadrature of g and 5 should be 
varied over measurements to estimate both quadratures. 

Qualitatively, the inferior Fisher information for heterodyne and homodyne detection 
can be attributed to the non-zero vacuum fluctuations even when no photon is coming in 
to provide information about the unknown parameters. Nonlocal measurements are able to 
perfectly discriminate against this case and discard the useless observations, but heterodyne 
or homodyne detection is unable to do so and forced to include vacuum fluctuations as 
potentially useful observations, resulting in a substantially worse estimation accuracy in the 
long run. 



Appendix F: Thermal light with arbitrary e 

For e > 1, it is necessary to use the full thermal state given by Eq. (1) in the main text. 
First consider the observation probability density of heterodyne detection: 

P(/i, V \g) = J d 2 ad 2 (3U(fx, u\a, /?)$(«, /%), (Fl) 

U(fi,u\a,(3) = (a,(3\E(fx,u)\a,P) (F2) 
= ^exp(-|/i-«| 2 - \u-f3\ 2 ) . (F3) 
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We can interpret these expressions using a semiclassical photodetection picture jlj: The 
heterodyne detection statistics obey II(/z, z/|a, 0) for given classical fields (a, (3), but the 
fields from the source also have a statistical distribution given by so the marginal 

observation density is taken to be II averaged over $. The resulting convolution of the two 
Gaussians can be calculated analytically and given by 



p( ^^ ) = ^deTr exp 

where the new covariance matrix V is 

r' = r + 



r 




(F4) 



: 


0^ 


H 









e/2 + 1 eg/2 
eg* 72 e/2 + 1 



(F5) 



The factors of 1 come from IT and represent detection noise. When e ^ 1, II is much 
sharper than $, so the convolution essentially reproduces <3> as the marginal observation 
density and P(fi,u\g) « z/|p). In other words, the inherent thermal noise from the 
source overwhelms the heterodyne detection noise in the e > 1 regime. 

To estimate the performance of heterodyne detection, we can consider the signal-to-noise 
ratio (SNR) {3, 6]. If we take ±iv* as the output signal, (/i^*) = eg/2 directly gives g on 
average, and the signal energy is 



S=\(pu*)\ 



^2112 
2 e |y| 



4 



(F6) 



det G — G aa Gt>b — G a bGbai 



The noise energy is 

N = (\fiu*\ 2 ) - S. (F7) 

The fourth-order field statistics can be computed with the help of the matrix G = T'^ 1 : 

(F8) 
(F9) 
(F10) 
(Fll) 

X = i 1 - t ! • (F12) 



(\fiis*\ 2 ) = detG 



d 2 



dG aa dG bb detG 
2G aa Gbb _ 1 
det G 2 detG 
e\ 2 e 2 \q\ 2 
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The SNR is hence 



e 2 \g\ 2 



(F13) 



N (2 + e) 2 ' 

For e < 1, S/N w e 2 | 2 /4, but for e > 1, the SNR saturates to S/N -» \g\ 2 and becomes 
independent of e. 

For direct detection, 



\ g ) = J d 2 ad 2 /3U(n, m\a, P)$(a, 0\g), 

Il(n, m\a, (3) = (a, j3\E(n,m)\a, j3) = \{n, m\u, v )| 2 , 

a \ l <\ \ 1/1 e iS 

\= V \ J ' 



\ 1 -e iS 



(F14) 
(F15) 

(F16) 



Changing the integration variables to (u,v), 

P(n, m\g) — J d 2 ud 2 vU.'(n, m\u, v)<&'(u, v\g), 



U.'(n,m\u,v) = exp(— \u\ 2 ) 

detK 
$ {u,v\g) = — 5— exp 

7T^ 



K = VT~ l V ] = 



\u 



2n 



■ exp(— |t>| 2 ) 



\v 



2m 



U* V* ) K 



ml 

u 

V 



det K = K aa K bb - K ab K ba 



l-Re(ge~ iS ) ilm(ge~ iS ) 
e(l-\g\ 2 ) y -ilm(ge- iS ) l + Re(ge- iS ) 
4 



(F17) 
(F18) 

(F19) 

(F20) 
(F21) 



e 2 (l-|<?l 2 )' 

The averaging of a Poissonian with a Gaussian is difficult to calculate exactly. For e ^> 1, 
however, the photon counts (n,m) should be large most of the time, and P(n, m\g) may 
be approximated by a Gaussian. Let us therefore focus on the first and second moments of 
(n,m) for P(n, m\g). The first moment of n is 



= — det K 



d 2 ud 2 v\u\ 2 &(u,v\g) 
d 1 



dK aa detK 



(F22) 

(F23) 
(F24) 
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Similarly, 



This gives 



w = - detJf dkap (F25) 

<" 2 ^<" >+detA 'Adi7f < F26) 

< m »> = W+ det^ s L f , (F27) 
<9 2 1 

H = det K ^ ~„ i — 77 • (F28) 

x 7 dK aa dK bb det K v ; 



(n> = | [1 + Re(<?e^)] , (F29) 
(m> = | [1 - Re(ge^ 5 )} , (F30) 



(An 2 ) = (n) + (n) 2 , (F31) 
(Am 2 ) = (m) + (m) 2 , (F32) 

(AnAm) = - lm{ge-' t5 ) 2 . (F33) 

A behavior similar to the case of heterodyne detection can be seen here. For (n), (m) ~e> 
1, the noise covariances scale as e 2 rather than e, indicating that the source thermal noise 
also overwhelms the Poissonian detection noise. 

Since the observation statistics are expected to be approximately Gaussian for e ^> 1, we 
can similarly consider the SNR as a performance metric. Taking the output signal asn-m, 
the average of which gives (n — m) = eRe(ge^), a quadrature of g, the signal energy is 

S = ( n - m) 2 = e 2 Re^e"* 5 ) 2 , (F34) 

and the noise energy is 

N= ((n-m) 2 ) -S (F35) 

= (An 2 ) + (Am 2 ) - 2 (AnAm) (F36) 

= 6 + t [1 + Re(ge-y - Imfee"") 2 ] • (F37) 

If we perform two measurements, one with S = Si and one with S = Si + it/ 2 to measure the 

other quadrature of g, the average signal and noise energies per measurement becomes 

- e 2 |o| 2 

S = -f-, (F38) 
N = e+ 6 -, (F39) 
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and the average SNR is 

3 _ eM 2 
N 2 + e' 



(F40) 



For e> 1, the SNR saturates to \g\ , just like the SNR of heterodyne detection, suggesting 
that the SNR is dominated by source thermal noise regardless of the detection method and 
nonlocal measurements do not have an advantage when e ^> 1. 

For e C 1, the SNR is still a valid performance metric for a large number of measurements, 
in which case the statistics become approximately Gaussian by the central limit theorem and 
averaging M observations improves the final SNR by a factor of M. The direct-detection 
SNR is « Me\g\ 2 /2 and significantly better than the heterodyne SNR « Me 2 |g| 2 /4, a fact 
well known in astronomy [j, (J and rigorously generalized in this paper. 



Appendix G: Quantum origin of measurement nonlocality in the semiclassical 

photodetection picture 

One may well wonder where quantum mechanics comes in, if both $ and II are nonneg- 
ative and the whole problem obeys classical statistics in the semiclassical photodetection 
picture. The answer lies in the fact that the likelihood function 

U(y\a,f3) = (a,f3\E(y)\a,(3) (Gl) 

cannot be an arbitrarily sharp probability distribution in quantum mechanics. It is the 
Husimi representation jj, more commonly applied to a quantum state but here to a POVM. 

If we regard U(y\a,/3) as a likelihood function of (a, (3) for a given observation y, the 
sharpness of H(y\a,{3) with respect to (a, (3) in phase space characterizes the amount of 
information about (a, (3) contained in the observation y. The Husimi representation has a 
maximum magnitude and a finite variance for each quadrature, which means that there is a 
limited amount of information about the fields that an observation can provide. 

The information of mutual coherence lies only in the nonlocal second-order field cor- 
relation a(3* for thermal light, the first-order mean fields of which are zero. If E(y) cor- 
responds to a local measurement and is separable into E a (y) ® Ef,(y), the sharpness of 
H(y\a,{3) = H a (y\a)Ilb(y\fl) with respect to a(3* would be more limited than that allowed 
by nonlocal measurements, meaning that local measurements extract less information about 



20 



the coherence than nonlocal measurements. In this sense, the measurement nonlocality 



lOj ; the former is a 



can be regarded as a dual of Einstein-Podolsky-Rosen entanglement 
property of bipartite measurements that can extract more information from certain separa- 
ble states and the latter a property of bipartite states that can produce higher correlations 
in certain separable measurements. 



